PyTorch implementation based on the original Transformer architecture from the 2017 paper 'Attention Is All You Need', a 65-million-parameter base model specifically trained for English-German translation tasks
Machine Translation Supports Multiple Languages